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REVERSE TWO-HYBRID METHOD 

5 

TFrHNTrAL FIELD 

The invention relates to methods and compositions for identifying agents 
which modify intermolecular association between two or more polypeptides. Agents which 
10 specifically inhibit such protein-protein interactions are suitable for use as commercial 

reagents, pharmaceuticals, and for modulating gene expression in a cell culture and/or ammal. 
such as to increase or decrease the expression of a predetermined protein in the cell culture or 
animal, and the like. 



RArKfiROUND 

Specific protein-protein interactions are fundamental to most cellular and 
organismal functions. Polypeptide interaaions are involved in formation of functional 
transcription complexes, signal transduction paAways. cytoskeletal organization (e.g.. 
microtubule polymerization), polypeptide hormone receptor-ligand binding, orgamzafon of 
multi-subunit enzyme complexes, and the like. 

Investigation of protein-protein interactions under physiological conditions has 
been problematic. Considerable effort has been made to identify proteins that bind to proteins 
of interest. Typically, these interactions have been detected by using co-precipitation 
experiments in which an antibody to a known protein is mixed with a cell extract and used to 
precipitate the known protein and any proteins which arc stably associated with it. Th^s 
method has several disadvantages, such as: (1) it only detects proteins which are associated m 
cell extract conditions rather than under physiological, intracellular conditions. (2) tt only 

detects proteins which bind to the known protein with sufficient strength and stabUity for 

efficient co-immunoprecipitation. and (3) it fails to detect associated proteins which are 
, displaced from Ae known protein upon antibody binding. For these reasons and others, 

improved methods for identifying proteins which interaa with a known protein have been 

developed. 

Tn.n-HY*''''^ Svstems 
One approach has been to use a so<aIled "two-hybrid' system to identify 
J polypeptide sequences which bind to a predetermined polypeptide sequence present in a fusion 
protem (Chien et al. (1991) Prnr N.tl Acd S.j. (USA) fig: 9578). Tl^is approach identifies 
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protein-protein interaaions in yivo through reconstitution of a transcriptional activator (Fields 
S and Song O (1989) Nature 340: 245). the yeast Gal4 transcription protein. The method is 
based on the properties of the yeast Gal4 protein, which consists of separable domains 
responsible for DNA-binding and transcriptional activation. Polynucleotides encoding two 
5 hybrid proteins, one consisting of the yeast Gal4 DNA-binding domain fused to a polypeptide 
sequence of a known protein and the other consisting of the Gal4 activation domain fused to a 
polypeptide sequence of a second protein, are constructed and introduced into a yeast host 
cell. Intermolecular binding between the two fusion proteins reconstitutes the Gal4 DNA- 
binding domain with the Gal4 activation domain, which leads to the transcriptional acUvation 
10 of a reporter gene (e.g.. lacZ, HISS) which is operably linked to a Gal4 bindmg site. 
Typically, the two-hybrid method is used to identify novel polypeptide sequences which 
interact with a known protein (SUver SC and Hunt SW (1993) Mo| . Biol. Rep. 12: 155; 
Durfee et al. (1993) £z£ne^DeveL 2; 555; Yang et al. (1992) SsiSDCfi 252: 680; Luban et al. 
(1993) £eU H: 1067; Hardy et al. (1992) Genes Pevel. fi; 801; Bartel et al. (1993) 
15 Rintechniaues 14: 920; and Vojtek et al. (1993) CeU 24: 205). However, variations of the 
two-hybrid method have been used to identify mutations of a known protein that affect its 
binding to a second known protein (Li B and Fields S (1993) fASlEl. 2' 957; Udo et al. 
(1993) Prnr N«tl. Acad Sci. fUSA-> 2Q: 5524; Jackson et al. (1993) MqI . Cell, B'iqI 12: 
2899; and Madura et al. (1993) .1 Biol. Chem. 268: 12046). Two-hybrid systems have abo 
20 been used to identify interacting structural domains of two known proteins (Bardwell &. al. 
(1993) Microbiol. &: 1 177; Chakraborty et al. (1992) T , Biol, Chem. 2fi2: 17498; 
Staudinger et al. (1993) T Binl. Chem. 2^: 4608; and MUne GT and Weaver DT (1993) 
Genes Devel. 2; 1755) or domains responsible for oligomerization of a single protein 
awabuchi et al. (1993) Oncogene S; 1693; Bogerd et al. (1993) LViloL £Z: 5030). 
25 Variations of two-hybrid systems have been used to study the ia viYfi activity of a proteolytic 
enzyme (Dasmahapatra et al. (1992) Proc N.tl Acad. Sci, fU?A) 4159). Alternatively, 
an E. coli/BCCP interactive screening system (Germino et al. (1993) Prnr Natl . Agad. S«. 
aj.S.A.-> 2Q: 933; Guarente L (1993) Pror Natl Acad. Spj. (U.S.A.) 2Q: 1639) can be used 
to identify interacting protein sequences O-e.. protein sequences which heterodimerize or form 
30 higher order heteromultimers). 

Each of these two-hybrid methods rely upon a positive association between 
two Gal4 fusion proteins thereby reconstituting a functional Gal4 transcriptional activator 
which then induces transcription of a reporter gene operably linked to a Gal4 binding site. 
Transcription of tiie reporter gene produces a positive readout, typically manifested either (1) 
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as an enzyme activity (e.g.. ^-gaJactosidase) that can be identified by a colorimetric enzyme 
assay or (2) as enhanced cell growth on a defined medium (e.g.. WSJ), nius. these methods 
are suited for identifying a positive interaction of polypeptide sequences, but are poorly su.ted 
for identifying agents or conditions which alter (e.g.. inhibit) intermolecular assoc.auon 
5 between two polypeptide sequences. In part, this is because a faUure to obtam 

expression of the reporter gene can result from many events which do not stem from a 
specific inhibition of binding of the two hybrid proteins. For example, a two-hybrid system 
using a reporter gene that stimulates growth under defined conditions theoretically can be used 
to screen for agents that inhibit the intermolecular association of the two hybrid protems. but 
10 it will be difficult or impossible to discriminate agents that specifically inhibit the associauon 
of the two hybrid proteins from agents which sin^ply inhibit cell growth. TTius. an agent 
which is cytotoxic to yeast (e.g.. bleach, phenol, ketoconazole, cydoheximide) wUl prevent 
cell growth without specifically inhibiting the interaction of two hybrid proteins and wdl score 
falsely as a positive hit. Similarly, a conventional two-hybrid system using a lacZ reporter 
15 gene will falsely score general transcription or translation inhibitors (e.g.. cyclohexumde) as 
being inhibitors of two hybrid protein binding. TTius, two-hybrid systems that produce a 
positive readout contingent upon intennolecular binding of the two hybrid protems are 
generally not suitable for screening for agents which inhibit binding of the two hybnd 
proteins. 

Oft Unfortunately, it would be desirable to have an efficient screening method for 

identifying compounds which specificdly alter the intermolecular association betw^^ 
known polypeptide sequences under physiological conditions. Present two-hybrid methods 
rely on a positive readout and do not afford a method for identifying binding inhibitors (or 
binding competitors) with satisfactory sensitivity and/or selectivity. 
25 nius. there is a need in the art for compositions and methods which can be 

used to efficienUy identify agents that specifically alter the mtermolecular association between 
two polypeptide sequences in mSl. present invention fiilfills these and other needs. 

The references discussed herein are provided solely for their disclosure pnor 
tothefilingdateofthepresentapplication. Nothing herein is to be construed as an admission 
30 thattheinventorsarenotentitiedtoantedatesuchdisclosurebyvirtueofpriorinvention. All 
publications and patent applications herein are incorporated by reference to the same extent as 
if each individual publication or patent application was specifically and individually indicated 
to be incorporated by reference. 
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-^TTMIf.^^Y ThA/FNTION 

The present invention provides several novel methods and compositions for 
identifying agents which alter intermolecular binding between two polypeptide species in a cell 
or in a cell-free transcription reaction. Tlie invention relates to a general method, referred to 
5 herein as a reverse two-hybrid method, wherein agents which disrupt an intermolecular 
association between two interacting polypeptides thereby generate a selecuble and/or 
detectable readout (e.g.. complementation of an auxotrophic phenotype, expression of a 
detectable reporter molecule, and the like). Typically, a reverse two-hybrid method produces 
a positive readout under conditions wherein an agent blocks or otherwise inhibits the 
10 intermolecular binding of the interacting polypeptides. A positive readout condition is 

generally identified as one or more of the following detectable conditions: (1) an increased 
transcription rate of a predetermined reporter gene. (2) an increased concentration or 
abundance of a polypeptide produa encoded by a predetermined reporter gene, typically such 
as an enzyme which can be readily assayed in vivo, and/or (3) a selectable or otherw«e 
15 idenUfiable phenotypic change in an organism (e.g., yeast) harboring the reverse two-hybnd 
system Generally, a selectable or otherwise identifiable phenotypic change that characterizes 
a positive readout condition confers upon the organism either: a selective growth advantage on 
a defined medium, a mating phenotype. a characteristic morphology or developmental stage, 
drug resistance, or a detectable enzymatic activity (e.g.. ^-galactosidase. luciferase. alkalme 
20 phosphatase, and dae like), in this manner, it is possible to efficiently identify agents 
Cincluding but not limited to polypeptides, small molecules, and oligonucleotides) which 
inhibit intermolecular binding between two predetermined interacting polypeptides. 

In an aspect of die invention, a reverse two-hybrid system is composed of: (1) 
a first hybrid protein. (2) a second hybrid protein which binds to the first hybrid protein under 
25 control conditions (e.g.. physiological conditions in the absence of agent), (3) a relay (or 

signal inverter) gene which is efficientiy expressed as a consequence of the first hybrid protem 
and the second hybrid protein being functionally bound to each other, and (4) a reporter gene 
which is efficientiy expressed when the product of die relay (or signal inverter) gene 
substantially absent and is either pooriy expressed or not expressed when the relay (or signal 
30 inverter) gene product is efficientiy expressed. TTie first hybrid protein and second hybrid 

protein bind to each otiier tiirough interacting polypeptide segments O-e., a poruon of the first 
hybrid protein preferentially binds to a portion of the second hybrid protein forming a 
heterodimer or higher order heteromultimer comprising the first and second hybrid proteins; 
said binding portions of each hybrid protein are termed -mteracting polypeptide segments"). 
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Tlie first hybrid protein comprises: (1) a first interaaing polypeptide sequence 
in polypeptide linkage with (2) a DNA-binding domain of a transcriptional activator protein or 
other DNA binding protein (e.g., a repressor). The second hybrid protein comprises: (1) a 
second interacting polypeptide sequence, capable of forming an intermolecular association with 
5 the first interacting polypeptide sequence under control conditions (e.g.. physiological 
conditions and absence of agent) in polypeptide linkage with (2) an activation domain of a 
transcriptional activator protein, whereby intermolecular binding between the first hybrid 
protein and the second hybrid protein (via the interacting polypeptide sequences) thereby 
unites the DNA-binding domain of the first hybrid protein with the activation domain of the 
10 second generating a transcriptional activator function. Generally, the first hybrid protein and 
the second hybrid protein are encoded by polynucleotides which are constitutively expressed in 
a host organism (e.g., a eukaryotic or prokaryotic cell, or multicellular organism). 

The relay gene (alternatively termed the signal inverter gene) is operably 
linked to a transcriptional regulatory sequence (a "relay transcriptional regulatory sequence") 
15 which is positively regulated by the transcriptional activator that is formed by the 

intermolecular binding of the first hybrid protein to the second hybrid protein. Hence, when 
the first hybrid protein binds to the second hybrid protein (via the interarting polypeptide 
sequences), the transcriptional activator formed thereby binds to a transcriptional regulatory 
sequence operably linked to the relay gene and enhances the net transcription of the relay 
20 gene. The relay gene encodes a protein that represses transcription of a reporter gene. Thus, 
when the first and second hybrid proteins are functionally bound to each other, the relay gene 
is expressed and thereby represses transcription of the reporter gene(s). In an embodiment, 
such relay proteins are of the type often referred to in the art as "negative regulators of 
transcription". In an embodiment of the invention, the relay gene is a negative regulator of 
25 transcription in yeast; for example but not limitation the GAL80 gene can serve as a relay gene 
in yeast. In embodiments where host organisms are employed to harbor the reverse two- 
hybrid system, the relay gene is often a gene which naturally occurs in the germline DNA of 
the host organism species, and frequenUy can be an endogenous germline gene, or 
alternatively may be introduced into the host organism as exogenous DNA, typically into a 
30 host genome that lacks the corresponding funaional endogenous gene (e.g.. a "knockout 
background"). 

The reporter gene is operably linked to a transcriptional regulatory sequence 
("reporter transcriptional regulatory sequence") which is negatively regulated by the gene 
produa of the relay gene and which is induced in the absence of the relay gene produa. 
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r.^. ^anscripuoo of reponer is repressed in co„.o, conditions (-S- P"^^"""^'"' 
liuons in .he absence or a.enO wherein*, two n,.riapro«i„s bind «,ea^2^^^^^ 

fom, a transcriptional activator that increases transcription of the relay gene. O ne .1,^.^ 
,e.ay ,ene prodnc. either binds u, the transcriptional regulatory sequence operabl, ..n.ed^ 
5 1 e^ru gene, or binds to a transcription protein that binds to the transcpfonal regulatory 
r^^loperabl lln^edtotbereportergene. net transcrip.on rate of t.e reporter g^ 

s reduced or completely bloclced) as a consequence of the relay gene product bntdtng to the 
err-sl^anscriptional regulatory sequence and/ortoat.an3cr,p.ionpro.e,„re.,u.^^r 

i::ti:ee,pressionoft.ereportergene. ^-^^ ^ ' ^-'"'^ 

A t.,n housed For example and not limitation, suitable reporter genes are 
in nositive readout can be usea. rui caoiui*. ^ . o 

:o:wh,cha)confer.selec.ablepheno.«»tocellsin.hichtberepor.ergene.e«^^ 

expressed, and/or P) encode a gene product (..g.. e„zyn», wbicb « 
su b as by in ^ assay or the liVe. Suitable genes wbicb confer a selectable phenoty^ « 
exen^lifiL by. but no. limited to. genes wbicb completnen. auxotrophic mutaucns u. a ^. 
15 gaIs.n(eI,east«S^.geneswbicb««>dedrugresis.anc.(e.g..^.S..-^^^^ 
U,L cell proliferadon. and other genes whose expression confen . selecu" row* 
advantage Suitable g«»s wbicb encode a gene product which is convementiy detected ui .m. 
":Zn«^hy. bu.no.lin.i.ed.0, ,-galac.osidase (e.g.. E.^^- ~ 

phosphatase, horseradish peroxidase, and the like. ^.^ .■ 

,„ -^.e invention provides polynucleotides encoding, first hybndproteu. and a 

second hybrid protein. Such polynucleotides encode . DNA-bindh,g domain ac«v^n 
dotnainJf.uanscripUonalacUvator and conveniently can baveacloningsttefor^^ 

.osertion. in reading f.an«. of polynucleotide sequences encoding one or more mte actn^ 
polypep.;dese,uence(s,.T,pically.a«rstpolynuc.ecUdewnUnco*^ 

of a firs, predetenni^d interacting polypepUde se<,.enc and a 
dod of a transcriptional acUvator; a second polynucleotide wBl encode a second bybnd 
protein composed of a second predetermined interacting polypeptide sequence » 
Lva^o. domain of a transcriptional acUvator. wherein the DNX-bindmg domain of the fir« 
^b rpltein can reconsUnate with the activation domain and form a Actional „t.o«al 
111: Often, the DNA-binding domain and the activauon domain of the hy...dp^^^ 

pair are derived from d,e same namral.y occurring transcripUon activator (e.g Gd^ 

However, those of skill in the ar. can select DNA-binding domains and acuvat, n domam^ 

from dist^cttranscriptionacuvators Which can reconstitutetofbrmafunc^n^^^^^^ 

- o rjMA-hindine domam of the bacterial lexA 
activator which does not occur m nature (e.g.. a DNA bmdmg ao 



25 



30 
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™ c» b. used in conjunction wiU, a «a„scnpUona. activator from the vM protein, 
V,6.VoJ,c.«=i.(.993,^.Ttan.cHptionana.,an.Utiono,suchapo.,nu^^^^^ 
ptodnces a hybrid (or ft^ion) protein composed of an interacting poiynuCeofde segment and a 
DNA-binding dontain or activation domain of a transcriptional ac«v«or. 

m invention also provides poiynuciectides whicb comprise a transcnptto.^ 

regulatory sequence operably linked ,0 a relay (or signal inverter) gene. A relay (or stgoa. 
^^rter) genlncodes aproteinthat inhibits or o,h«wise represses express,on (typtcall. 
Lei tL) of a predetermined reporter ge,». Most usually, a relay protem » a negauve 
ofLansc^lon^rapredetermined gene or gen. subset., an 

,0 rl protein is a transcription repressor protein that binds to a polynucleotide sequence a^ 
rebylnbibitstranscriptionofacis-linltedandoperablylinltedscuence. - — 
IbodLn..,.. relay protein binds toaprouin that isaposUive.egula.rof.r„^^ 
apredetermined gene or gene subset. ar^asaconsequenco,bind.ngth„ebymh,b.»d» 

.lcriptional.cUvi.yof,bepositiveregula.or. ^"'^ °' '^^^ ':ZT 
,5 and blocks the activation domain(s) of transcriptional activators. Although a var,«y of 
:Irl*pro,elnsareappare...u>.hose„fsmin.h.ar..hi.ca.egoryofre.aypro.e.^ 
rb^^^em^^fiea by the manunalian^oncoprotei,..hich binds the t.^^^ 
domain of the turnor suppressor protelnp^. and th. yeas. Ga.«Oproteu,.,chb,^s«^ 
i^vates the acUvation domain of Ga.4. m an embodiment *. relay protem comprtse, a 
.0 mutation, addition, or deleUo. that reduces the stabUity of the re.., protem « ^ « 
:mpa«;tothen.tura,„occurringcognatere.aypr«ein. "-"V P— ^ 
as signal inverter proteit». as they serve to invert . posiUv. transcrtpuonal s.gna. (.h. 
::^ll„fafimctiona..rans«ipUonal activator by bindirtg of «.e«^ 
■mto .negauvetranscript^na. signal, which ^e-- transcription of . pr.d~dr^r« 
^ gene. « a polymtcleoUde encoding a relay protein U operably luOted «, a ...J 

„anscrip.iona.regula.oryse,uencetha.producestranscripU„nofth.re.ayge™d^e^^^^ 
upon^ncUonalreconsUrutionoftheDNA-bindingdomainar-acUvattondon^om^ 
hrbrid proteins. For example and no. limiution. such a relay transcnpt^nal 
sLuen^ can comprise a promo«r and a polynucleotide se,uence compnsmg o« or more 
30 s^nhichbinds)areconsUtdtedfi.nctionaltranscriptIona.ac.iva»rft,rm«lb^ 
::t.ohybridproteins;fore.ample.if,hetw„hybrid.ra„scripUonalactiv«j^^^ 

DNA-binding domair. the relay transcriptional regulatory s«,uence operabl, ..nked to 
relay gene can comprise one or more (ex. binding si» s«,ue.ces. arrayed m and«n. 

T^e invemion also provides polynudeoddes which comprtse a uanscrtpfona. 
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regulatory sequence operably linked to a reporter gene. The reporter gene encodes a protein 
that confers a selectable phenotype on a host cell and/or can be detected by an m vivo assay, 
such as an in situ enzymatic assay (e.g.. host cells expressing a lacZ reporter can be detected 
as blue staining cells in the presence of X-gal). A transcriptional regulatory sequence 
operably linked to the reporter gene comprises a promoter and generally produces constituuve 
transcription of the relay gene contingent upon the substantial absence of the relay protein. In 
an embodiment, the reporter transcriptional regulatory sequence operably linked to the 
reporter gene comprises a binding site for a relay protein, wherein binding of the relay protem 
to the reporter transcriptional regulatory sequence inhibits constitutive transcription of the 
reporter gene. In an alternative embodiment, the reporter transcriptional regulatory sequence 
linked to the reporter gene comprises a binding site for a transcriptional activator protein, 
wherein binding of a constitutive transcriptional activator protein to the reporter transcnptional 
regulatory sequence produces constitutive transcription of the cis-linked reporter gene, and 
wherein the relay protein binds to or otherwise inactivates the transcriptional activator protein, 
thereby repressing constitutive expression of the reporter gene. 

The invention also provides host organisms (typically unicellular organisms) 
which harbor a reverse two-hybrid system, typically in the form of polynucleotides encoding a 
first hybrid protein, a second hybrid protein, a relay gene, and/or a reporter gene, wherem 
said poIynucleotide(s) are either subly replicated or introduced for transient expression. In an 
, embodiment, the host organism is a yeast cell (e.g.. SacchMQIDiCM ^^^^ "^tch the 
germiine GAL80 gene is functionally inaaivated. die relay gene encodes Gal80. and the 
reporter gene transcriptional regulatory sequence comprises a Gal4-responsive promoter. 

nie invention also provides a method for identifying agents that inhibit 
binding of a first interacting polypeptide to a second interacting polypeptide. THe mediod 
5 employs the reverse two-hybrid system described supra, wherein a first hybrid protein 
comprises the first interacting polypeptide and a second hybrid protein comprises a second 
interacting polypeptide. Heterodimerization (or higher order heteromultimerization) between 
the first hybrid protein and the second hybrid protein produces transcription of a relay gene 
encoding a protein which inhibits expression of a reporter protein. Host organisms harbonng 
0 such a reverse two-hybrid system are cultured in the presence of an agent, such as a diffusible 
small molecule (typical MW < 5.000. preferably < 1.000) or a transfected cDNA expression 
polynucleotide encoding a polypeptide agent, and expression of the host organism reporter 
gene is determined and standardized to a parallel blank culture which lacks an agent. Agents 
which produce a . jnificant increase in expression of the reporter gene in a host orgamsm 
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3ft„ a sui<*.e .tae period <e.g., usualW « least 1 .our. often a. ,ea« 3 hours preferably 
about 6 hours, occaisioual.y overnight or longer) are hereby identifled as ,nh,b.tors for 
blocking the ihter^olecular association between *e first an second in.eractu,g polypepude 
scuences. Such protein interaction inhibitors are candidate d.gs for phannaceuncai us. 
5 arl/or .br use as conunercia, research reagents. In an emhodimen, of the —. yeas. 
:;^^rarethehostorganisn,,a,ereportergenee„codes.-ga.actosidaseand/oraprote..,ha, 

complements an auxouophic mutant yeas. bos. cel.. and the firs, and second .nuracung 
polypeptide se,uences each comprise a binding domain derived ftom a s.gnal transductron 

,0 -n-einventionalsoprovidesaUtcomprisingareversetwo-bybridsystem. 
host cel.. and an instruction manual. Such Wts may optionally include a pane, of agents ft,r 
testing. 

nrrftP nP^rRipnoN 

,5 defined „U,erwise.al. technic, and scieoUfie terms us^l herein have 

U,e same meaning as comn^nly understood by one Of ordinary s.cn.ta.hear.»«hid,th., 
invem.onbe.ongs. A..houghany meorods and ma»ri..s slmiiaror e<,u,va.«« » *<^^^^ 
described herein can b. used in the practice or .esfing of the present mventron. the prefer,»i 

« lILsandma.eria.saredescr.bed. For purposes of the presen. invention. *e fo..o«^ 

terms are defined below. , 

to d,e polypepude notation used herein, the .efthand dtrecuon .s d,. amn«. 

direcUon and the Hghthand direction is the carboxy^na. -^"^^ 
with s.a.«.ard«s.ge and conv^ttion. Similarly, unless specified otherw^e, d» .ef.h.^ end of 

^ shtgle-stranded polynucleotide sequences is U.e 5- end; the lefthand ^^^'^^ ^. 
strLed polynucleoUde se,uences is referr«. to as ... 5' dlreoion. T*e — " 
addition olascentRNA transcripts is referred u, as d,etra.scHp.K,n d^o.;s«^^^^ 

regions on Ore DNA strand having the same se,uenc as the RNA and whtch are 5 » *ej 
elf the RNA transcript are referred to as -upstream se<,u«^-; se^-^^^^^^ 
30 DNA strand having dtesatn. sequence as Ute RNA and which are 3- to the 3 endoftheRNA 

transcript are referred to as "downstream sequences" . 

•n-e term ■natura..y.>cc«rring" as used herein as app..«i - object refers to 

d^efacutatanobjeccanbefound innamre. For example. ^^^"'"J^'^ 
sequence tha. is present in an organism CUK-Udingviruseajtha. en beisola^d^m. source 



PCTAJS95/03918 

WO 95/26400 

10 

in nature and which has not been intentionally modified by man in the laboratory is namrally- 
occurring. 

The term "agent" is used herein to denote a chemical compound, a mixture of 
chemical compounds, a biological macromolecule. or an extract made from biological 
5 materials such as bacteria, plants, fungi, or animal (particularly mammalian) cells or tissues. 
Agents are evaluated for potential activity as specific protein interaction inhibitors O-e.. an 
agent which selectively inhibits a binding interaction between two predetermined polypeptides 
but which does not substantially interfere with cell viability) by inclusion in screening assays 
described hereinbelow. 

10 The term "protein interaction inhibitor" is used herein to refer to an agent 

which is identified by one or more screening method(s) of the invention as an agent which 
selectively inhibits protein-protein binding between a first interacting polypeptide and a second 
interacting polypeptide. Some protein interaaion inhibitors may have therapeutic potential as 
drugs for human use and/or may serve as commercial reagents for laboratory research or 
15 bioprocess control. Protein interaction inhibitors which are candidate drugs are then tested 
further for activity in assays which are routinely used to predict suitabUity for use as human 
and veterinary drugs, including ia vivo administration to non-human animals and often 
including administration to human in approved clinical trials. 

As used herein, the term "operably linked" refers to a linkage of 
20 polynucleotide elements in a functional relationship. A nucleic acid is "operably Tmked" when 
it is placed into a functional relationship with another nucleic acid sequence. For instance, a 
promoter or enhancer is operably linked to a coding sequence if it affects the transcription of 
the coding sequence. Operably linked means that the DNA sequences being linked are 
typically contiguous and. where necessary to join two protein coding regions, contiguous and 
25 in reading frame. However, since enhancers generally function when separated from the 
promoter by several kflobases and intronic sequences may be of variable lengths, some 
polynucleotide elements may be operably linked but not contiguous. 

As used herein, the term "endogenous DNA sequence" refers to naturally- 
30 occurring polynucleotide sequences contained in a eukaryotic or prokaryotic cell. Such 

sequences include, for example, chromosomal sequences (e.g.. structural genes, promoters, 
enhancers, recombinatorial hotspots. repeat sequences, integrated proviral sequences). A 
-predetermined sequence" is a sequence which is selected at the discretion of the practitioner 
on tiie basis of known or predicted sequence information. An exogenous polynucleotide is a 
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polynucleotide which is transferred into a eukaryotic or prokaryotic cell. 

As used herein the term "physiological conditions" refers to temperature, pH, 
ionic strength, viscosity, and like biochemical parameters which are compatible with a viable 
organism, and/or which typically exist intracellularly in a viable culmred yeast cell or 

5 mammalian cell. For example, the intracellular conditions in a yeast cell grown under typical 
laboratory culture conditions are physiological conditions. Suiuble in Yitro reaction conditions 
for in vitro transcription cocktails are generally physiological conditions. In general, in vitrg 
physiological conditions comprise 50-200 mM NaCl or KCl. pH 6.5-8.5. 20^5-C and 0.01- 
10 mM divalent cation (e.g.. Mg-. Ca-); preferably about 150 mM NaCl or KCl. pH 7.2- 

10 7.6, 5 mM divalent cation, and often include 0.01-1.0 percent nonspecific protein (e.g.. 
BSA). Particular aqueous conditions may be selected by the practitioner according to 
conventional methods. For general guidance, the following buffered aqueous conditions may 
be applicable:- 10-250 mM NaQ. 5-50 mM Tris HQ. pH 5-8. with optional addition of 
divalent cation(s) and/or metal chelators and/or nonionic detergents and/or membrane 

15 fractions. 

The terms "functional disruption" or "functionally disrupted" as used herem 
means that a gene locus comprises at least one mutation or structural alteration such Aat the 
funrtionally disrupted gene is substantially incapable of directing the efficient expression of 
functional gene product. 

20 As used herein, the terms "interacting polypeptide segment" and "interacting 

polypeptide sequence" refer to a portion of a hybrid protein which can form a specific binding 
interaction with a portion of a second hybrid protein under suitable binding conditions. 
Generally, a portion of the first hybrid protein preferentially binds to a portion of the second 
hybrid protein forming a heterodimer or higher order heteromultimer comprising the first and 

25 second hybrid proteins; the binding portions of each hybrid protein are termed mteracting 

polypq>tide segments. 

Oescription of the Invention 

Generally, the nomenclature used hereafter and the laboratory procedures in 
30 cell culture, molecular genetics, and nucleic acid chemistry and cell culture described below 
are those well known and commonly employed in the art. Standard techniques are used for 
recombinant nucleic acid methods, polynucleotide synthesis, and microbial culture and 
transformation (e.g.. electroporation. lipofection). Generally enzymatic reactions and 
purification steps are performed according to the manufacturer's specifications. The 
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techniques and procedures are generally performed according to conventional methods in the 
an and various general references (see, generally. Sambrook et al. Molecular Cloning: A 
Laboratory Manual. 2d ed. (1989) Cold Spring Harbor Laboratory Press. Cold Spring 
Harbor. N.Y.. which is incorporated herein by reference) which are provided throughout this 
5 document. The procedures therein are believed to be well known in the art and are provided 
for the convenience of the reader. All the information contained therein is incorporated herein 
by reference. 

The reverse two-hybrid method is generally applicable for identifying agents 
which inhibit binding between a variety of predetermined interacting polypeptides. 

10 

Overview 

A basis of the present invention is a strategy for screening a bank of agents 
with a reverse two-hybrid system to identify agents which inhibit the intermolecular 
association of two interacting polypeptide sequences. Thus, in a reverse two-hybrid system 
15 there is at least one pair of interacting polypeptide sequences, with a first interacting 

polypeptide sequence present in one of the hybrid protein species and a second interacting 
polypeptide sequence present in the other hybrid protein species. The choice of interacting 
polypeptide sequences incorporated in a reverse two-hybrid system is selected at the discretion 
of the practiUoner. For example, a reverse two-hybrid system suitable for identifying agents 
20 which inhibit Fos/Jun leucine zipper formation may be composed of a first hybrid protein 
having an interacting polypeptide sequence comprising a Fos leucine zipper and a second 
hybrid protein having an interacting polypeptide sequence comprising a Jun leucine zipper. A 
variety of interacting protein sequences can be used; for example and not limitation, these 
include: transcription factor binding domains, multisubunit proteins, signal transduction 
25 proteins (G proteins, members of ras/raf/MEK signaling pathway(s), tumor suppressor protein 
binding domains (/». p5S), and the like), polypeptide ligands and their cognate receptor(s), 
active sites of enzymes which catalyze reactions involving binding to a polypeptide substrate 
and the polypeptide substrate itself, and essentially any pair of protein sequences which form 
an intermolecular association under physiological conditions. Generally, interacting 
30 polypeptides form heterodimers with a dissociation constant (Ko) of at least about 1 x itf M ', 
usually at least 1 x 10* M". typically at least 1 x 10» M-. preferably at least 1 x lO* M ' to 1 
X 10' M-* or more. under suitable physiological conditions. 

Reverse two-hybrid systems are used for detecting the ability of agents to 
inhibit the intermolecular binding of two interacting polypeptides and provide for facUe hi^- 
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throughput screening of agent banks (e.g.. compound libraries, peptide expression libraries, 
and the like) to identify protein interaction inhibitors which preferentially inhibit 
intermolecular binding between two predetermined interacting polypeptide species. Such 
protein interaction inhibitors (specific binding antagonists) can modulate biochemical acUvrty 
5 of the predetermined interacting specie(s) and thereby modulate biological function. Agents 
which alter the intermolecular association of the two interacting polypeptide sequences m the 
hybrid proteins, generally by inhibiting heterodimeric binding of the two hybrid protems. 
score positively in the reverse two-hybrid system. The protein interaaion inhibitors thereby 
identified are candidate drugs for human and veterinary therapeutic use and/or are su.uble 
10 commercial reagents for laboratory research or bioprocess control. 

An agent capable of specifically inhibiting protein-protein binding of a 
therapeutically relevant protein interaction in vivo can be used for therapy of disease or for 
modulation of gene expression in cells and organisms. Typically, an efficacious dose of a 
protem interaaion inhibitor is administered to a patient as a therapeutic or prophylactic for 
15 treating a pathological condition (e.g.. cancer, inflammation, lymphoproliferative diseases, 
autoimmune disease, and the like). 

r>f>«!rri ption of the Preferr* ^ Fmbodiment 

In order to Uiustrate the invention, a description of a preferred embodiment b 

20 cresented below. TTiis embodiment comprises a reverse two-hybrid system in yeast cells that 
'are functionally disrupted for endogenous GAl^ expression, wherein the intermolecular 
association of the first and second hybrid proteins activates transcription of a GAI^ relay 
gene. Expression of GAL80 represses the transcriptional activity of constitutively expressed 
Gal4 protein and inhibits transcription of a Gal4-dependent reporter gene. 

25 A variety of alternative embodiments and variations will be apparent to those 

of skill in the art. including alternative relay genes, alternative host cells (e.g.. mammalian, 
bacterial, fungal, insect, and tiie like), variations of the basic reverse two-hybrid method, and 
others. 

T«,r>-HYhriH systems 

Transcriptional activators are proteins that positively regulate the expression of 
specific genes. TTiey can be functionally dissected into two structural domains: one region 
that binds to specific DNA sequences and thereby confers specificity, and another region 
termed the activation domain that binds to protein components of the basal gene expression 
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machinery (Ma and Ptashne (1988) CeU 55: 443). These two domains need to be physically 
connected in order to function as a uanscriptional activator. Two-hybrid systems exploit this 
finding by hooking up an isolated DNA binding domain to one protein (protein X), whUe 
hooking up the isolated activaUon domain to another protein (protein Y). When X and Y 
5 interact to a significant extent, the DNA binding and activation domains wUl now be 
connected and the transcriptional activator function reconstituted (Fields and Song (1989) 
Nature 34Q: 245). The yeast host strain is engineered so that the reconstimted transcriptional 
activator drives the expression of a specific reporter gene such as HIS3 or lacZ, which 
provides the read-out for the protein-protein interaction (Field and Song (1989) op^; Chein 
10 et al (1991) Q£^. One advantage of two-hybrid systems for monitoring protem-protem 
interactions is their sensitivity in detection of physically weak, but physiologically important, 
protein-protein interactions. As such it offers a significant advantage over other methods for 
detecting protein-protein interactions (e.g.. ELISA assay). Unlike the ELISA assay, 
however, the current two-hybrid system is not readily transplantable to drug screemng 
15 operations. A major problem with the existing two-hybrid methods is that nonspecific 
inhibitors of transcriptional activation score the same as inhibitors of the specific protem- 
protein interaction. 

Mp p^fivft Regulaf "rf »f Transcription 
To address the aforementioned problem, the readout of the conventional two- 
20 hybrid interaction can be reversed by interposition of a relay gene which serves to invert die 
output produced from interaction of the two hybrid proteins from a positive transcriptional 
activator to a negative transcriptional regulator (e.g.. repressor). In order to mvert die 
readout from a positive transcription activator to a negative transcription repressor, it » 
possible to take advantage of the properties of certain negative regulators of transcription. In 
25 an embodmient, some of these negative regulators block the function of specific transcripuonal 
activators by binding to their aaivation domain. Two such examples are the mdm2 
oncoprotein which binds to and masks the trans-activation domain of the tumor suppressor 
proteinpii (Momand et al. (1993) C^l 62: 1237; Oliner et al. (1993) Ilamre 2ii2: 857) and 
the yeast GaI80 protein which binds and inactivates the transcriptional activator region of Gal4 
30 (Ma and Ptashne (1987) Cdl 5Q: 137; Johnston and Carlson (1993) Fe puhtion of Carbon ^ 
PWnH.t. Metabolism, vol. 2. Cold Spring Harbor Press. Cold Spring Harbor. New York). 
By designing the two-hybrid interaction to drive the expression of a negative regulator of a 
specific transcriptional activator, the resultant system is such that interference with the two- 
hybrid interaction results in increased activity of a transcriptional activator and hence a 
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pcsWvesign^. ,„ view of *e fachat *e biology of *e G3l8(VGal4 sys«m is «eH 
!„den.cod in yeas,, *is pair o, „ega,ive-reg.,ators/«anscriptio„al aaiva.o,s „ su.uble for U,e 
reverse .wo-hybrid me*od. In principle. *e pair of ^ and pS3 proteins, or any o*er 
matched pair of transcriptional activator and specific negative reg„.a«>r. will also work. 
5 In the present embodiment, the .wo interacting hybrid proteins reconstitute a 

^anscriptional acUvator composed of a DNA binding domain derived ftom 
protein encoded by ^ and an activator domain derived from the vtral protetn VP,6 (Vo,tek 
et al. (1993) oi^. T^' reconstinited /e^/VW« transcriptional activator binds to a relay 
gene operably linked to a transcriptional regulatory 
,0 sequence containing tandem copies of a toO binding site sequence which binds the ier. 
DNA-binding domain. Upon binding of die reconstituted leMfVP16 transcriptional acnvator 
.0 the («0 binding site(s), the operably linked relay gene (GAW is efRcienUy exp,ess»i. 
•n,us when the nvo hybrid proteins are associated (e.g.. as a heterodimer or the hke). the 
GA^ relay gen. is expressed and serves » repress expression of a reporter gene constnrC. 

15 

Th- 'ri-l't'K''''* K^nam 
He Gal80^al4 system of regulatory proteins underiies the ability of yeast 
cells to respond to exogenously added galactose and specifically synthesize the enxyn^s 
needed to utilize it as . carbon/energy source (Johns«,n and Carlson (1993) Unless 
20 galactose is present the Gal80 protein binds and blocks the — of the transcripoonal 
activator Oal4. In th. absence of the GALSO gene, the tran^iptional acUvator funcuon of 
Gal4 is no. masked and bene, expression of galactos^«gula«d genes no longer requires 
galacws. for inducUon. U. .he r«ers. two-hybrid system, ft. bos. strain generally » 
LncUonally disrupt, fcr ft. endogenous GMJiO ge^. bu. GalSO pro»in U provided ftrou^ 
25 a two-hybrid driv«. r.Uy gen. construe. <^ Experim«>ui Example, l-l/h.) which .s op«ably 
,ink«l «. a ttanscripdonal regulatory sequence fta. binds a baaerial lexA DNA-btndmg 
domain presen. to ft. firs, hybrid pro«in. When fte two-hybrid in^action is drivmg ft. 
expression of fte relay gene product. GalgO. fte Gal4.i«.ucUon of ft. reporter gene(s) ts 
inhibited, men ft. two-hybrid interaction is blocked, fte relay gen. (C^ „n wUl 
30 be mmed off and ft. Gal4-dep.ndem uanscriprional regula»ry s«,u««. op«ably l>nk«i to 
fte reporttr gene(s) is fteo able to drive expression of fte reporttr ga«<s). 

T..Kp T„.< for Fin - -^IT Tf ""r^sion of ft. Relay (jgi e 
Tre sensitiviv of ftis system can be modulated by »ii«s.mg ft. amount of 
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Gal4 or Gal80 protein. A host strain generally contains the wild-type GAL4 gene and hence 
contains very low levels of Gal4 when the yeast cells are cultured with carbon/energy sources 
such as raffmose (Johnston and Carlson (1993) OExiL). If necessary, the level of Gal4 
protein can be decreased by at least five-fold by culturing the cells in glucose (Griggs and 
5 Johnston (1991) °- ^^^^ ^"^^ ^SA^ M'- 8597). Higher levels of Gal4 protein can 
be provided by transforming the strain with a multicopy plasmid encoding Gal4 (Schultz et al. 
(1987). 

The amount and/or stability of the relay protein. Gal80. can also be adjusted. 
Preferably, the stability of the Gal80 protein is sufficient such that the addition of protein 

10 interaction inhibitor agents generates a detectable readout of the reporter gene(s) within about 
six hours, or most usually within the time-frame of an overnight assay. For this to be a 
convenient assay approach. Gal80 activity preferably deteriorates at a rapid rate when active 
inhibitor agents are added and the two-hybrid system is inhibited. The half-life of Gal80 
proteins in yeast cells has not been rigorously defined in the art. If Gal80 has a short half- 

15 life, it is generally only necessary to vary the level of transcription of GAL80 by changing 
either copy number of the two-hybrid relay gene construct or by varying the number of 
binding sites for the transcriptional activator (e.g., lexO operator sequences) in the 
transcriptional regulatory sequence of the relay gene construct. If Gal80 has an inordinately 
long half-life, it is preferable to engineer a chimeric Gal80 protein with a shorter half-life. 

20 Successftil engineering of long-lived proteins to proteins with shorter half-lives has been 
achieved by addition of PEST sequences to DHFR (Loetscher et al. (1991) J. Bigl. Chgm, 
266:11213) or by forming /S-galactosidase variants with different N-terminal residues by ifl 
vivo processing of ubiquitin-/S-galactosidase fusions (Varshavsky et al. (1989) Qen^p 
Fngineering . Bart, Brake, and Valenzuela (eds.), Butterworths, pp. 109-143). The latter 

25 method has been well characterized in yeast, such that GalSO variants with half-lives ranging 
from 2 minutes to over 24 hours can be readily generated. 

The following examples are offered by way of example and not by way of 

limitation. 

FYPERIMEN TAT FYAMPLES 
30 rnnstruction of the appro priate host veast strains 

Since the GAL80-GAL4 system is employed, the reporter genes in the yeast 
strain need to be operably linked to promoters that are responsive to Gal4. Reporter genes 
that have been operably linked to Gal4-responsive promoters were integrated into yeast strains 
(see Construction of Yeast Strains, infra). One of the reporter genes encodes ^-galactosidase. 
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„bo« expression allows quantiutive „anscriptional readout, if desired. It is possible to 
utilize other reporter genes operably linked to Gal4-responsive promoters, such as o.« 
encoding alkaline phosphatase, that would also allow easy quantitation of transcriptional read- 
out JEY8 JEYIO. and JEY12. three independent progenitors for the reverse-two hybnd host 
5 sualns. we^e derived by standard genetic memods from a cross between YM2170 ^MATa 
US3 adc2 iy,2 t,rl GAIA- galSO^ lEU2:OALl-lacZ: available from Dr. Mark Jobnsu,n. 
Washington University. SL Uuis. MO) and YPB2 (MATa MsS a^2 Uu2 ura3 tys2 ,rpl can, 
,al4A iolSOl. LYS2::aALl-mS3 URA3::CALl.lac2i (Bartel e. al. (1993) in 

;„r..v>.on.en.- A Praoric al approach. Hartley DA (ed.) Oxford UnlverslQ, 
10 Press, Oxford. UK. pp. 153). The progenitor strains iMATa «rf <^2 Uu2 uraS lys2 np, 
GALT gatSOA LrS2.:GALI-HlS3 VRA3.:aALl-laa, contain all the necessary reporter genes 
and have been tested for a funcdonal Ga)4 protein and reporter genes by analysis of galactose- 
induced expression of fl-galac«>sidase. To test it. general whether re-introdnction of GdSO 
protein negatively regulates Gal4 in the system. JEY8 was transfcrmed with a higb^ 
,5 plasmid containing the wild-type CM^ gene (pBM260; avaUabl. from Mark iohnston 
Washington University. Louis. MO). Suffcien. expression results in inhibtaon of the 
readouts from the reporter genes ifUSS and tocZ). which are determined by assaymg B- 
galactosidase aaivity and growth in the absence of hisUdine. Both of *ese reporter acuv.«s 
are scored in yeast grown on plates containing raJBnos. (which allows fer fiJl «:dv,.y of 
20 OalSO protein) and galactose (which inactivates CalSO protein). Tl.es. tests confirm ^ u> 
ftese strains d.e GalSO protein, expressed off its endogenous promoter, suppresses Gal4 
function. Two*,brid constructs are evaluated for their abUity to drive sufficient GMIt) 
expression from the LextMMUO fusion plasmids that are construcBd (jts. »!#«.). 

25 r'l T"""'"'"" °* I*" ' fa"''" ^ , 

A chimeric gene (a UxO-GALSO iusion) is constructed and serves as .he relay 

(Signal mverter) gene, "me DNA-binding domain of the transcriptional .«iva.or that is us«i 

to drive expression of U.. relay gene is derived from the bacterial protein encoded by /ex. and 

has been used before in two-hybrid systems as a fusion with the transcriptional activator ftom 

30 the viral protein VP.6 (Vojtek e. al. (1993) Odter tianscripdonal activator ^ ave 

a defined DNA binding site, such as the ACE, gene product of S. cere^^ (Munder and 

Furst (.992) MsLCdUid. 12: 2091) may be used. The l^O sites are getteraud by 

munrally primed synthesis (5«. Chapter 8.2A in r p,nt.. oh in Mol^.lar BloHHiV 

(1990) Ausubel. Brent. Kingston, Moore. Seidman. Smith, and Struhl (eds.). Greene 
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Publishing Associates and Wiley Interscience. New York, NY) using the oligomer 
GCGAATrCCrACrGTATATACATACAGTACCATCrACTGTATATACATACAGTAGC 
CGCrCGAGCGGC-3' [SEQ. ID NO:!]. TT^e resulting fragment contains four consensus 
5 LexA binding sites in tandem. The DNA product is digested with EcoRI and inserted into the 
EcoRI site of pCZD (Lue et al. (1989) Proc N.rl Ar.d Sci. (USA) M: 486) to generate 
pCZD-LexO The pCZD vector contains a minimal TATA box for recognition of the basal 
transcriptional machinery but requires the addition of specific DNA sequences to effeaively 
function as promoter box. TT.e Gal80 coding sequence is isolated by PGR using the foUowmg 
10 two oligomers: S'-CGCGGATCCCGTrCTTrCCAC TCCCG-3' [SEQ. ID NO:2]; and 

5'-CGGATCCGATGGAAGGATGCCCGCrGCrGC-3' [SEQ. ID N0:3]. THe template is the 
plasmid PBM260 which contains the GAUO gene subcloned in YEpl3 (available from Mark 
Johnston. Washington university, St. Louis. MO), 'n.e PGR produa is digested with 

BamHI and inserted into the BamHI site of pCZD-LexO to create pLexO-Gal80. IHe LexO- 
15 Gal80 fusion is then subcloned mto pGalUeo, a 2m based yeast shutUe vector (20-30 cop.es per 
cell) carrying the ADE2 selectable marker (available from Avtar Roopra. Washmgton 
University. St. Louis. MO) to generate pJE42. From this plasmid. CEN- and integrating 
versions are constructed to provide a means of controlling the level of expression of CAUO 
by the two-hybrid interaction. For example, the basal transcription from the 2m plasmid may 
20 express sufficient Gal80 to require galactose for expression of the reporter gene(s) eve^ m 
absence of a lexA-based transcriptional activator. Additionally. LexO-ubiquitin-GaI80 fusions 
encoding a shortened half-life Gal80 protein is constructed. 

In order to demonstrate that relay gene constructs comprising a LexO-GAL80 
polynucleotide fusion can be activated by the two-hybrid interaction to sufficient levels for 
25 regulating the GalA-mediated reporter gene expression, a positive control is generated. Yeast 
are transformed with a plasmid that contains a fusion of the DNA binding domain GexA) and 
transcriptional activation component (VP16) of the two-hybrid system and activates 
transcription of the relay gene Le.0-GAL80 Vision. Hie plasmid pLEX-VP16 (avaUable from 
A. Vojtek; Vojtek et al. (1993) o^^ U used for the positive control. The ability of two- 
30 hybrid interactions to drive expression of the relay gene is demonstrated. 

Tr -tinr fnr tb^ ---"^ Y t^>^-^vHrid inter.rtions to drive ex pr.>-Hnn o f the 1 exO-GALgQ 

fusions . ■ . 

m two-hybrid mteraaion that is used to test for its abUity to drive sufficient 
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expression of the LexO-CALSO relay gene is the interaction of human H-ras p21 with human 
c-Raf (Van Aeist et al. (1993) Pror N.tl Ar.d Sci. (USA) 2Q: 6213; Vojtek et al. (1993) 
op.cit.) K-ras is linked by in-frame polynucleotide fusion to the VP16 activation domain, and 
Raf is linked by in-frame polynucleotide fusion to the DNA binding domain of the lexA gene 
5 produa pGBT8-Raf was constructed by ligating EcoRI and PstI linkers to a ;?a/codmg 
sequence isolated by PGR amplification of a human placental cDNA library from Stratagene 
(San Diego, CA) as described by MacDonald et al. (1993) Mol , Cell. Biol, H: 6615. THe 
Raf gene was cut out of pGBT8-Raf as an EcoRI to PstI fragment and subcloned into the 
EcoRI-PstI site of pBTMl 16 (that contains the LexA DNA binding domain (Vojtek et al. 
10 (1993) OE^ to generate pBTM-Raf (pJE36). The EcoRI site maintains the same readmg 
frame pGBT8K-ras was constructed by PGR amplification of pEXV-K-ras (Hancock et al. 
(1990) £en 61: 133) such that the K-ras sequence is isolated as a Sall-PstI restriction fragment 
which was then subcloned into Sall-PsU-cut pGBTS. To construct pVPK-ras (pJE44), a PGR 
product of pGBT8K-ras was generated using the following oligomers as PGR amphmers: 
15 5'-CGGGATCGATGAGTGAATATAAACrTGTGGTAG-3- [SEQ. ID NO:41 

5--CGGGATCCrrACATAATrACACACnTGTCrrrCACrrG-3- [SEQ. ID NO:5] 
and the resultant PGR product was digested with BamHI and subcloned into the BamHI site of 
PVP16 (Vojtek et al. (1993) ^ to generate pVPK-ras (pJE44). Hie UxO-GAI^ relay 
gene plasmid. the pBTM-Raf and the pVP-K-ras (pJE44) plasmids are cotransfected into a 
20 host yeast strain and the ability of the two-hybrid interaction to drive sufficient expression of 
CAL80 to prevent the expression of the reporter genes (tocZ and HIST, is dete.u.ined. 
Growth on galactose is used as an internal positive control to ensure that the promoter rs stdl 
functional. 

H,e reverse two-hybrid method is used as . screening assay for rfenufying 
small nK.iecule inhibitors of protein-protein interaction, such that an exog«K,usly add«i sn«ll 
molecule can interfere with a two-hybrid interaction. In one example, a reverse mo^.ybnd 
system utUizes the small molecule estradiol as the protein interaction inhibitor. Estradiol B a 
30 small lipophUic molecule that has been shown to be effecdve in yeast. I. has been shown .h« 
estnKiiol reverses the interaction of the hormone binding domai.. of the es.rog«. receptor w..h 
me heat-shoclc protein HSP90. Thus. . first hybrid protei., comprUing .he hormone bindmg 
domain of the estrogen recep«>r in polypeptide linkage to a le^tA DNA-binding domain and a 
second hybrid protein comprising the beat shock protein, HSm in polypeptide Imkage to the 



PCrAJS95/03918 

WO 95/26400 

20 

VP,6 ^ivadon don^ are consuuced by standard .neU-ods. Polynuc.ec.ide(s)encodU,g *e 
firs, and second hybrid proteins, a UxC^OAIW reiay gene comuuct. and a Ga.«epende« 
reporter gene comtruc. are introduced into the yeast host. Estrogen (e.g., estradtol) « 
evaluated as an agent for inhibiting fonnation of a ft.nc.ional two-hybrid heterodimer and 
5 thereby producing expression of the reporter gene. 

.K. ,.n„v of n noh^^ti de tn im^rfer P n. jth , t..o-|,vhrid imeracm- 

The reverse two-hybrid method is used as a screening assay for tdenttfymg 
polypeptide inhibitors of protein-protein interaction, such that an intracellularly expressed 
,0 pol^eptide can interfere with a two-hybrid interaction, one entbodinaen, a reverse two- 
hybrid system uU.i«s a polypeptide expressed ftom a cotransfected cDNA expressto. 
constnia as the protein interaction inhibitor. 

A first hybrid protein comprising a first interaaing polypept.de sequence m 
polypeptide linkage .0 a lexA DNA-btading domain and a second hybrid protein cotnprising a 
,5 econd interacting po.ypepUde sequence in polypepUdelinlcage to the W.6ac«vat.^ 

are constructed by standard methods. Po.ynucleotide(s) encoding the first »Ki second hybrn. 
pr«eins. a -lay gene construe and a GalWependen. reporter gen. cons«^ 

are introduced into ^.e yeas. host. A po.ynucleo.ide encoding «>d expressing a polypepode 
typically bemeen 5 and 500 amino acids long (e.g.. a library member of a cDNA express-on 
20 library) is also in,roduc«l into the yeas, cells under condWons wherein the encoded 

polypeptide is expressed intracellularly. TTe expressed polypeptide U evalua^l as an ag«.. 
for inhibiting fonnation of a fitnaional two-hybrid heurodhner and dtereby producu^ 
expression of Uie reporter gene. .u. .„ ™„i>, 

Essentially any of various expression clone libranes known m die ar. may be 
25 used, including commercially available expression libraries (aonuch. Inc.. Palo A1» CA). 
Expression clone libraries may also be generated by the practitioner by conventional domng 
medtods and vectors known in the art (e.g.. pcD. pSV). especially yeas, express.- vec«=rs. 
Expression clone libraries comprise a collection of library members, each member comprtsmg 
a Cloned polynucleotide sequence (which may comprise muta.ion(s) or deletions), .^.cally a 
30 CDNA sequence, operably linked to a promoter (and optionally an enhancer) whtch « 

,ranscriplal,yactive-mthehos.ce.lso,ha.titeclonedseq„enceUtranscrlbedandtiansia^ 

into a polypeptide. Genomic DNA sequences (e.g.. complete strucmral genes or ftagme^ 
thereoO may also serve as cloned sequences in expression libraries. Preferably, the don^ 
sequence is inserud in cloning site which facUitates d,e recovery of tite cloned sequence 
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from the promoter and other sequences which comprise an expression cassette. 

Expression clone library members are transferred into host cells by various 
means, including but not limited to: electroporation, lipofection. viral vector transduction, 
biolistics, and CaPO. precipitation. Expression clone library members may be transferred 
5 directly into host cells, or a relay and/or reporter polynucleotide and/or polynucleotide(s) 
encoding the first and second hybrid proteins may be co-transferred with expression clone 
library members into a host cell, or a relay and/or reporter polynucleotide and/or 
poIynucleotideCs) encoding the first and second hybrid proteins may be transferred into host 
cells subsequent to transfer of expression clone library members. 
10 Cloned polynucleotides can be recovered from expression clone library 

members which are isolated by the screening methods of the invention. Typically, cloned 
sequences are excised by restriction digestion with an enzyme(s) which cleave at the 
boundaries between the ends of the cloned sequence (e.g., cDNA) and the remainder of the 
expression clone library member. Alternatively. PGR (preferably high-fidelity PGR) or other 
15 amplification method (e.g., LCR) may be performed using primers which flank the site at 

which the cloned sequence is inserted in the library member to amplify and thereby isolate the 
cloned sequence (U.S. Patent 4.683,202. incorporated herein by reference). When PGR is 
used it is generally preferable to incorporate known unique polynucleotide sequences flanking 
at least one. and preferably both. side(s) of the site in which a cloned sequence is inserted to 
20 facilitate recovery of the selected cloned sequence(s). 

Although the present invemion has been described in some detaU by way of 
illustration for purposes of clarity of understanding, it wQl be apparent that certam changes 
and modifications may be practiced withm the scope of the claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: ERICKSON. JAMES R. 

POWERS. SCOTT 

(ii) TITLE OF INVENTION: REVERSE TWO-HYBRID METHOD 

(iii) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooley Godward Castro Huddleson & Tatum 

(B) STREET: Five Palo Alto Square, Fourth Floor 
(Q CITY: Palo Alto 

(D) STATE: California 

(E) COUNTRY: US 

(F) ZIP: 94306-2155 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
(Q OPERATING SYSTEM: PC-DOS/MS-DOS 
(D) SOFTWARE: Patentin Release #1.0, Version #1.25 



(vi) CURRENT APPUCATION DATA: 

(A) APPLICATION NUMBER: US 08/218.933 

(B) HLING DATE: 29-MAR-1994 
(Q CLASSIHCATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Torchia, Timothy E. 

(B) REGISTRATION NUMBER: 36.700 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 843-5481 

(B) TELEFAX: (415) 857-0663 

5 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base paiis 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION: 1..68 

(D) OTHER INFORMATION: /standard_nanie= "PCR primer- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCC^TTCCT .CTO™ CT^C C.TCT.CTOT .T^T.CT^ .CT.CCCOCT 

30 

CGAGCGGC 



35 
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(2) INFORMATION FOR SEQ ID NO:2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 
5 (B) TYPE: nucleic acid 

(Q STRANDED NESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

15 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..26 

(D) OTHER INFORMATION: /standard_name= "PGR prinaer" 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
CGCGGATCCC GTTCTTTCCA CTCCCG 

25 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
30 (B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



wo 95/26400 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

5 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION: 1.. 30 

(D) OTHER INFORMATION: /standard_naine= "PCR primer" 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
CGGATCCGAT GGAAGGATGC CCGCTGCTGC 

15 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 
20 (B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

25 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

30 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION: 1..33 

(D) OTHER INFORMATION: /standard_name= "oligomer ft 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CGGGATCCAT GACTGAATAT AAACTTGTGG TAG 

5 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 
10 (B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

20 

(ix) FEATURE: 

(A) NAME/KEY: misc_feattire 

(B) LOCATION: 1..39 

(D) OTHER INFORMATION: /standard_name= -oligomer 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



CGGGATCCTT ACATAATTAC ACACTTTGTC TTTCACTTG 

30 
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fTAlMS: 

1 A reverse two-hybrid system comprising: 

(1) a first hybrid protein comprising a first interacting polypeptide sequence in 
polypeptide linkage to a DNA-binding domain of a transcriptional activator; 

(2) a second hybrid protein comprising a second interacting polypepnde 
sequence in polypeptide linkage to an activation dor^n of a transcriptional activator, wherein 
the second hybrid protein binds to the first hybrid protein via contact of the interacting 

^ polynucleotide sequences under physiological conditions; 

(3) a relay gene whose transcription is dependent upon the first hybrid protem 
and the second hybrid protein being bound to each other, thereby reconstituting a 

transcriptional activator; and , , 

(4) a reporter gene whose transcription is repressed by expression of the relay 
gene and which is substantially transcribed in the absence of relay gene expression. 

2. A reverse two-hybrid system of claim 1. wherein the two-hybrid 
system is in a yeast host cell. 

J A r««ise two-hybrid system of claim I . wheiein the first hybrid 
orctem. the seco.^ hybrid protein, a relay protein, a reporter protein are encoded on a. 
ieas. on. polynucleotide and said at leas, one polynucleotide is inUoducol into a yeas. nos. 
ceU. 

4 A reverse two-hybrid sysKmofclaim I. wherein <he fits. 

25 hybridpr«..incompris.s.lexADNA-bindi„gdomaininpolypepad.linlcag.<ofl»firs. 

interacting polypeptide sequence; 

the second hybrid protein comprises a VP16 activation domam m polypeptide 

linkage to the second interacting polypeptide sequence; 

die relay gene encodes Gal80 operably linked to a LexO sequence m a cu.- 
30 linked relay gene transcription regulatory sequence; and 

the reporter gene comprises lacZ or HISS and is operably linked to a 

transcription regulatory sequence which confers GalA^ependent transcription to cis-Unked 

adjacent polynucleotide sequences. 
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5. A reverse two-hybrid system of claim 4 in a yeast cell produced by 
crossing a SMchargmyces organism having the genotype MATa his3 ade2 leu2 ura2 lys2 trpl 
CAU"" galSOD LYS2::CAL1-HIS3 URA3::GALl-ladZ. 

5 6. A reverse two-hybrid system of claim 4 further comprising an agent. 

7. A reverse two-hybrid system of claim 4 further comprises an 
expression clone library member which expresses an intracellular polypeptide in the yeast 



host 

10 
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8. A yeast cell comprising a reverse two-hybrid system of claim 1 or 4. 

9. A polynucleotide encoding a GalSO polypeptide and comprising at least 
one operably linked LexO binding site. 

10. A polynucleotide of claim 9 in a yeast ceU which contains a 
functionally disrupted endogenous GALSO gene. 

11. A yeast cell containing: 

20 a polynucleotide sequence encoding a first hybrid protein whJch is 

constitutively expressed; 

a polynucleotide sequence encoding a second hybrid protein which is 

constitutively expressed; 

a polynucleotide sequence encoding a relay protein whose expression is 

25 dependent upon intermolecular binding of the first hybrid protein with the second hybrid 
protein under physiological conditions; and 

a polynucleotide sequence encoding a reporter protein whose expression is 

repressed by the relay protein. 

30 12. A yeast cell of claim 1 1 , further comprising an expression clone 

library member which expresses a polypeptide encoded by a cDNA. 



13. A yeast cell of cUim 11. further comprising an agent having a 
molecular weight of less than 1,000 daltons. 
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14. A method for identifying agents which inhibit intermolecular binding 
under physiological conditions between a first interacting polypeptide sequence and a second 
interacting polypeptide sequence, said method comprising the steps of: 

administering an agent to a host cell containing a reverse two-hybrid system of 
5 claim 1 or 4 and incubating the host cell for a suitable period; 

determining whether the administration of the agent induces a substantially 
expression of the reporter gene; and 

identifying an agent which induces a substantially expression of the reporter 
gene as a protein interaction Inhibitor. 

10 

15. A method of claim 14, wherein the agent is a molecule having a 
molecular weight less than about 1,000 daltons. 

16. A kit con^irising a reverse two-hybrid system, a host cell, an 
IS instruction manual, and optionally a panel of agents for testing. 
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17. A reverse two-hybrid system of claim 4, wherein the first interacting 
polypeptide sequence is a mammalian ras polypeptide aiul the second interacting polypeptide 
sequence is a Raf polypeptide. 



INTERNATIONAL SEARCH REPORT 



Intemaiioiul application No. 
PCT/US93/03918 



. CLASSinCATION OF SUBJECT MATTER 
IPC(6) :C12N 15/00. 15/06. 15/11. 15/12. 15/81 
US CL : 435/7.1. 240.1, 254.21: 536/23.1, 23.2 
According to IntemaUonal Patent Classification (IPC) or to both national classification and IPC 



HELDS SEARCHED 



Minimum documentaUon searched (classincation system followed by elassincation symbols) 
U.S. : 435/7.1, 69.1. 69.9. 172.3. 240.2, 243. 320.1; 935/37. 43254.21; 536/23.1. 23.2 



Documentation searched other than minimum documentation to the extent thai such documents are included in the fields searched 



Electronic data base consulted during the intenwUonal search (name of data base and, where practicable, search terms used) 
Please See Extra Sheet. 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



Cell, Volume 74, issued 16 July 1993, A.B. Vojtek et al, 
"Mammalian Ras Interacts directly with the serine/threonine 
kinase Raf", pages 205-214, see entire document. 

Proceedings of the National Academy of Sciences, USA, 
Volume 88, issued December 1991, H.A. Vasavada et al, "A 
contingent replication assay for the detection of protein- 
protein interactions in animal cells", pages 10686-10690, 
see entire document. 

Nature, Volume 340, issued 20 July 1989, Fields et al, "A 
novel genetic system to detect protein-protein interactions", 
pages 245-246, see entire document. 



1-17 



( ) Further documcntt axe listed in the continuation of Box C. Q See patent family annex. 


SpecaloutMuaorcaedil an da,,^ Bo«ipeonllie»wia.«h« w«ic«««»>^««««» »««^«»»^ 

*A* il<iciBiadd«<faimtlfc*|cMnl«Meof1teutwhacki»MicoiiMdc)«d principla or Oicmy undcHyinc die iavcabea 

*E* cariierdaeumea«piiblahedaaoriAcrtbaiBtereMoaainbii«d«i ctxuidcTtd twvcl or canaot Im oooaUcnri to involve «a iovaitive 
•L* doannei*wUehm»y lhf«w4oublioBpnoiily etaiiB<i)of whk* • wfcea *o doeuraem ■ t«ke« iloo* 

ciud lo oBbiah Ibe publnnioo d«lB of tDnOta atntkai or otbcr doeuracal of p.ttieuhf Rlcvwier. iho eteimed invcntioa cnoot ho 
qaedol ream (•• tfcalkOt coo«aered lo mvolve mt invouivo Mcp wlwa (he doeinneat ■ 
•O- d«»a.r.f»T«.»«.«.-i.c.o««.««,.^n>i.io.o,od« ^^-^tT Sl^.'^^- 

*P* docimcBtpufctkkd priori Ilia ialen»iiaiMdriliii«daU but b«CT«hw *&* dacoDcal member of «w «k pueat fmily 


Date of the actual completion of the international search 
21 JUNE 1995 


Date of ntailing of the international search report 

lO JUL 1395 


Name and mailing address of the ISA/US 
Cominiinoaer of PUMtsaad TndecaMfcs 
BoxFCT 

WasUagtoo. D.C. 20231 
Facsimile No. Cms 305-3230 


Authorized officer |(J-}7\//>/? /^^^ 

JAMES MARTINELL 
Teleohone No. a03) 308-0196 



Fonn PCT/lSA/210 (second sheelKJuly 1992)« 



INTERNATIONAL SEARCH REPORT 



Intemaiional application Nn. 
PCT/US95/03918 



B. nELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 

ENTREZ 16.0: two-hybrid, two, hybrid, yeast, Saccharomyces. gal.... reverse.... gal.... tex... 
APS: two. hybrid. 435/clas, 935/cIaj. SOO/clas, galSO. gal{w)80. lexo. lex(w)o 



Fbtm PCT/ISA/210 (extn iheetXJuly 1992)* 



